智能论文笔记

Automatic Speech recognition for Speech Assessment of Persian Preschool Children

Amirhossein Abaskohi , Fatemeh Mortazavi , Hadi Moradi

分类：自然语言处理

2022-03-24

学龄前评估至关重要，因为它为教师和父母提供了有关儿童成长和成长的关键知识。冠状病毒大流行强调了在线评估学龄前儿童的必要性。这种在线测试需要各种技术，从Web应用程序开发到各种标准（例如语音识别）的各种人工智能模型。由于声学的波动和儿童和成人之间语音频率的差异，因此很难采用自动语音识别（ASR）系统，因为它们是在成年人的声音上预先训练的。此外，培训新模型需要大量数据。为了解决此问题，我们使用具有新的预训练目标的WAV2VEC 2.0模型为认知测试系统构建了ASR，称为随机频率音调（RFP），而我们的新数据集则在无意义的单词（MW）和New DataSet上进行了测试（MW）和快速自动命名（RAN）测试。由于这两个测试的特殊性，我们探索了许多模型，包括卷积神经网络（CNN）和WAV2VEC 2.0模型。我们的新方法在CommonVoice数据集的波斯部分上达到6.45的单词错误率（WER）。此外，我们的新方法在零和少数场景中产生积极的结果。

translated by 谷歌翻译

Russia-Ukraine war: Modeling and Clustering the Sentiments Trends of Various Countries

Hamed Vahdat-Nejad , Mohammad Ghasem Akbari , Fatemeh Salmani , Faezeh Azizi , Hamid-Reza Nili-Sani

分类：自然语言处理

2023-01-02

With Twitter's growth and popularity, a huge number of views are shared by users on various topics, making this platform a valuable information source on various political, social, and economic issues. This paper investigates English tweets on the Russia-Ukraine war to analyze trends reflecting users' opinions and sentiments regarding the conflict. The tweets' positive and negative sentiments are analyzed using a BERT-based model, and the time series associated with the frequency of positive and negative tweets for various countries is calculated. Then, we propose a method based on the neighborhood average for modeling and clustering the time series of countries. The clustering results provide valuable insight into public opinion regarding this conflict. Among other things, we can mention the similar thoughts of users from the United States, Canada, the United Kingdom, and most Western European countries versus the shared views of Eastern European, Scandinavian, Asian, and South American nations toward the conflict.

translated by 谷歌翻译

Focal-UNet: UNet-like Focal Modulation for Medical Image Segmentation

MohammadReza Naderi , MohammadHossein Givkashi , Fatemeh Piri , Nader Karimi , Shadrokh Samavi

分类：计算机视觉

2022-12-19

Recently, many attempts have been made to construct a transformer base U-shaped architecture, and new methods have been proposed that outperformed CNN-based rivals. However, serious problems such as blockiness and cropped edges in predicted masks remain because of transformers' patch partitioning operations. In this work, we propose a new U-shaped architecture for medical image segmentation with the help of the newly introduced focal modulation mechanism. The proposed architecture has asymmetric depths for the encoder and decoder. Due to the ability of the focal module to aggregate local and global features, our model could simultaneously benefit the wide receptive field of transformers and local viewing of CNNs. This helps the proposed method balance the local and global feature usage to outperform one of the most powerful transformer-based U-shaped models called Swin-UNet. We achieved a 1.68% higher DICE score and a 0.89 better HD metric on the Synapse dataset. Also, with extremely limited data, we had a 4.25% higher DICE score on the NeoPolyp dataset. Our implementations are available at: https://github.com/givkashi/Focal-UNet

translated by 谷歌翻译

Utilizing distilBert transformer model for sentiment classification of COVID-19's Persian open-text responses

Fatemeh Sadat Masoumi , Mohammad Bahrani

分类：自然语言处理

2022-12-16

The COVID-19 pandemic has caused drastic alternations in human life in all aspects. The government's laws in this regard affected the lifestyle of all people. Due to this fact studying the sentiment of individuals is essential to be aware of the future impacts of the coming pandemics. To contribute to this aim, we proposed an NLP (Natural Language Processing) model to analyze open-text answers in a survey in Persian and detect positive and negative feelings of the people in Iran. In this study, a distilBert transformer model was applied to take on this task. We deployed three approaches to perform the comparison, and our best model could gain accuracy: 0.824, Precision: 0.824, Recall: 0.798, and F1 score: 0.804.

translated by 谷歌翻译

On-device Training: A First Overview on Existing Systems

Shuai Zhu , Thiemo Voigt , JeongGil Ko , Fatemeh Rahimian

分类：机器学习

2022-12-01

The recent breakthroughs in machine learning (ML) and deep learning (DL) have enabled many new capabilities across plenty of application domains. While most existing machine learning models require large memory and computing power, efforts have been made to deploy some models on resource-constrained devices as well. There are several systems that perform inference on the device, while direct training on the device still remains a challenge. On-device training, however, is attracting more and more interest because: (1) it enables training models on local data without needing to share data over the cloud, thus enabling privacy preserving computation by design; (2) models can be refined on devices to provide personalized services and cope with model drift in order to adapt to the changes of the real-world environment; and (3) it enables the deployment of models in remote, hardly accessible locations or places without stable internet connectivity. We summarize and analyze the-state-of-art systems research to provide the first survey of on-device training from a systems perspective.

translated by 谷歌翻译

Evolutionary Deep Reinforcement Learning for Dynamic Slice Management in O-RAN

Fatemeh Lotfi , Omid Semiari , Fatemeh Afghah

分类：人工智能 | 机器学习 | 神经与进化计算

2022-08-30

需要下一代无线网络以同时满足各种服务和标准。为了解决即将到来的严格条件，开发了具有柔性设计，分解虚拟和可编程组件以及智能闭环控制等特征的新型开放式访问网络（O-RAN）。面对不断变化的情况，O-Ran切片被研究为确保网络服务质量（QoS）的关键策略。但是，必须动态控制不同的网络切片，以避免由环境快速变化引起的服务水平一致性（SLA）变化。因此，本文介绍了一个新颖的框架，能够通过智能提供的提供资源来管理网络切片。由于不同的异质环境，智能机器学习方法需要足够的探索来处理无线网络中最严厉的情况并加速收敛。为了解决这个问题，提出了一种新解决方案，基于基于进化的深度强化学习（EDRL），以加速和优化无线电访问网络（RAN）智能控制器（RIC）模块中的切片管理学习过程。为此，O-RAN切片被表示为Markov决策过程（MDP），然后最佳地解决了资源分配，以使用EDRL方法满足服务需求。在达到服务需求方面，仿真结果表明，所提出的方法的表现优于DRL基线62.2％。

translated by 谷歌翻译

HTML版本

Mismatching-Aware Unsupervised Translation Quality Estimation For Low-Resource Languages

Fatemeh Azadi , Heshaam Faili , Mohammad Javad Dousti

分类：自然语言处理

2022-07-31

翻译质量估计（QE）是预测机器翻译（MT）输出质量的任务，而无需任何参考。作为MT实际应用中的重要组成部分，这项任务已越来越受到关注。在本文中，我们首先提出了XLMRScore，这是一种基于使用XLM-Roberta（XLMR）模型计算的BertScore的简单无监督的QE方法，同时讨论了使用此方法发生的问题。接下来，我们建议两种减轻问题的方法：用未知令牌和预训练模型的跨语性对准替换未翻译的单词，以表示彼此之间的一致性单词。我们在WMT21 QE共享任务的四个低资源语言对上评估了所提出的方法，以及本文介绍的新的英语FARSI测试数据集。实验表明，我们的方法可以在两个零射击方案的监督基线中获得可比的结果，即皮尔森相关性的差异少于0.01，同时在所有低资源语言对中的平均低资源语言对中的无人看管竞争对手的平均水平超过8％的平均水平超过8％。。

translated by 谷歌翻译

Clustering Object-Centric Event Logs

Anahita Farhang Ghahfarokhi , Fatemeh Akoochekian , Fareed Zandkarimi , Wil M. P. van der Aalst

分类：人工智能

2022-07-26

流程挖掘提供了各种算法来根据事件数据分析过程执行。过程发现是过程挖掘技术的最突出类别，旨在从事件日志中发现过程模型，但是，在使用现实生活数据时会导致意大利面模型。因此，已经在传统事件日志（即带有单个情况概念的事件日志）上提出了几种聚类技术，以降低过程模型的复杂性并发现案例的均匀子集。然而，在现实生活中，尤其是在企业对企业（B2B）过程的背景下，流程中涉及多个对象。最近，已经引入了以对象为中心的事件日志（OCEL）来捕获此类过程的信息，并在OCEL的顶部开发了几种过程发现技术。然而，提出的关于真实OCEL的发现技术的输出导致更具信息性但更复杂的模型。在本文中，我们提出了一种基于聚类的方法，用于群集在OCEL中类似对象，以简化所获得的过程模型。使用对实际B2B过程的案例研究，我们证明我们的方法降低了过程模型的复杂性，并生成了对象的相干子集，这些子集有助于最终用户获得对流程的见解。

translated by 谷歌翻译

Self-Supervised Learning of Echocardiogram Videos Enables Data-Efficient Clinical Diagnosis

Gregory Holste , Evangelos K. Oikonomou , Bobak Mortazavi , Zhangyang Wang , Rohan Khera

分类：计算机视觉

2022-07-23

鉴于难以获得医学图像识别任务的高质量标签，因此需要对小标签数据集进行充分调整的深度学习技术。自我监督学习技术的最新进展表明，这种内域表示学习方法可以为监督微调提供强大的初始化，这比从监督预读的任务中比标准转移学习更为数据效率。但是，这些应用程序不适用于以视频格式捕获的医学诊断。考虑到这一进展，我们开发了一种自我监督的学习方法，该方法迎合了超声心动图视频，目的是学习强有力的表现，以诊断主动脉瓣狭窄的任务（AS），这是一种主动脉瓣的常见和危险疾病，这是主动脉瓣的常见和危险疾病。当对1％的培训数据进行微调时，我们最好的自我监督学习模型可实现0.818 AUC（95％CI：0.794，0.840），而标准转移学习方法达到0.644 AUC（95％CI：0.610，0.677）。我们还发现，我们的自我监督模型在预测严重的情况下，与显着图可视化所证明的严重相关。

translated by 谷歌翻译

Density-Aware Personalized Training for Risk Prediction in Imbalanced Medical Data

Zepeng Huo , Xiaoning Qian , Shuai Huang , Zhangyang Wang , Bobak Mortazavi

分类：机器学习

2022-07-23

由于大多数入院的患者生存，因此感兴趣的医疗事件（例如死亡率）通常以较低的速度发生。具有这种不平衡率（类密度差异）的训练模型可能会导致次优预测。传统上，这个问题是通过临时方法（例如重新采样或重新加权）来解决的，但在许多情况下的性能仍然有限。我们为此不平衡问题提出了一个培训模型的框架：1）我们首先将特征提取和分类过程分离，分别调整每个组件的训练批次，以减轻由类密度差异引起的偏差；2）我们既有密度感知的损失，又是错误分类的可学习成本矩阵。我们证明了模型在现实世界医学数据集（TOPCAT和MIMIC-III）中的改进性能，以显示与域中的基线相比，AUC-ROC，AUC-PRC，BRIER技能得分的改进。

translated by 谷歌翻译